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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
TGACCTCGGG GCAGGTCCTG GTGCAGAGCG TCGCCAAGGA |CGCCGAGAGG GAGGCGGGAT 
TGCCCAGACA TCCTTCAGCG AAGTGCATGT GTGTTTGTAA ACCATCGTTG GCTGTCGGGA 
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GACCGCGAdG ACCGGTCCAG GCTGCGGCGG AGTCGAGGGC GAGGGAGAGG CCGCGTGAGT 18 0 

GAGCAGAGTC CAGAGCCGTG CGCCCCCAGA ACTGCGCGTC CGCCCCGTGC ACCCCCGCGC 2 40 

GCCATGCCCA GTTGCCCCGC GCGCTCTGCT ACGGGCCCGC TCTCCATCAT GGGCCTCATG 3 00 

CCGCTCACCA AGGAGGTGGC C7\AGGGCAGC ATCGGGCGCG' GTGTGCTCCC CGCCGTGGAA 3 60 

CTGGCCATCG AGCAGATCCG CAACGAGTCA CTCCTGCGCC CCTACTTCCT CGACCTGCGG 42 0 

CTCTATGACA CGGAGTGCGA CAACGCAAAA GGGTTG7VAAG CCTTCTACGA TGCGATAAAA 4 80 

TACGGGCCGA ACCACTTGAT GGTGTTTGGA GGCGTCTGTC CATCCGTCAC ATCCATCATT 54 0 

GCAGAGTCCC TCCAAGGCTG GAATCTGGTG CAGCTTTCTT TTGCTGCAAC CACGCCTGTT 6 00 

CTAGCCGATA AGAAAAAATA CCCTTATTTC TTTCGGACCG TCCCATCAGA CAATGCGGTG 6 60 

AATCGAGCCA TTCTGAAGTT GCTCAAGCAC TACCAGTGGA AGCGCGTGGG CACGCTGACG 72 0 

CAAGACGTTC AGAGGTTCTC TGAGGTGCGG AATGACCTGA CTGGAGTTCT GTATGGCGAG 78 0 

GACATTGAGA TTTCAGACAC CGAGAGCTTC TCCAACGATC CCTGTACCAG TGTCAAAAAG 84 0 

CTGAAGGGGA ATGATGTGCG GATCATCCTT GGCCAGTTTG ACCAGAATAT GGCAGCAAAA 90 0 

GTGTTCTGTT GTGCATACGA GGAGAACATG TATGGTAGTA AATATCAGTG GATCATTCCG 9 60 

GGCTGGTACG AGCCTTCTTG GTGGGAGCAG GTGCACACGG AAGCCAACTC ATCCCGCTGC 102 0 

CTCCGG7\AGA ATCTGCTTGC TGCCATGGAG GGCTACATTG GCGTGGATTT CGAGCCCCTG 10 80 

AGCTCCAAGC AGATCAAGAC CATCTCAGGA AAGACTCCAC AGCAGTATGA GAGAGAGTAC 114 0 

TUVCAACAAGC GGTCAGGCGT GGGGCCCAGC AAGTTCCACG GGTACGCCTA CGATGGCATC 12 0 0 

TGGGTCATCG CCAAGACACT GCAGAGGGCC ATGGAGACAC TGCATGCCAG CAGCCGGCAC 12 60 

CAGCGGATCC AGGACTTCAA CTACACGGAC CACACGCTGG GCAGGATCAT CCTCAATGCC 132 0 

ATGAACGAGA CCAACTTCTT CGGGGTCACG GGTCAAGTTG TATTCCGG7\A TGGGGAGAGA 13 80 

ATGGGGACCA TTAAATTTAC TCAATTTCAA GACAGCAGGG AGGTGAAGGT GGGAGAGTAC 14 40 

AACGCTGTGG CCGACACACT GGAGATCATC AATGACACCA TCAGGTTCCA AGGATCCGAA 150 0 

CCACCAAAAG ACAAGACCAT CATCCTGGAG CAGCTGCGGA AGATCTCCCT ACCTCTCTAC 15 60 

AGCATCCTCT CTGCCCTCAC CATCCTCGGG ATGATCATGG CCAGTGCTTT TCTCTTCTTC 162 0 

AACATCAAGA ACCGGAATCA GAAGCTCATA AAGATGTCGA GTCCATACAT GAACAACCTT 168 0 

ATCATCCTTG GAGGGATGCT TTCCTATGCT TCCATATTTC TCTTTGGCCT TGATGGATCC 17 4 0 

TTTGTCTCTG AAAAGACCTT TGTW^CACTT TGCACCGTCA GGACCTGGAT TCTCACCGTG 18 00 

GGCTACACGA CCGCTTTTGG GGCCATGTTT GC7W\GACCT GGAGAGTCCA CGCCATCTTC 18 60 

AAAAATGTGA AAATG7\AGAA GAAGATCATC AAGGACCAGA AACTGCTTGT GATCGTGGGG 192 0 

GGCATGCTGC TGATCGACCT GTGTATCCTG ATCTGCTGGC AGGCTGTGGA CCCCCTGCGA 19 80 

AGGACAGTGG AGAAGTACAG CATGGAGCCG GACCCAGCAG GACGGGATAT CTCCATCCGC 2 04 0 



139 

CCTCTCCTCJG AGCACTGTGA GAACACCCAT ATGACCATCT GGCTTGGCAT CGTCTATGCC 2100 

TACAAGGGAC TTCTCATGTT GTTCGGTTGT TTCTTAGCTT GGGAGACCCG CAACGTCAGC 2160 

ATCCCCGCAC TCAACGACAG CAAGTACATC GGGATGAGTG TCTACAACGT GGGGATCATG 222 0 

TGCATCATCG GGGCCGCTGT CTCCTTCCTG ACCCGGGACC AGCCCAATGT GCAGTTCTGC 22 8 0 

ATCGTGGCTC TGGTCATCAT CTTCTGCAGC ACCATCACCC TCTGCCTGGT ATTCGTGCCG 2 340 

AAGCTCATCA CCCTGAGAAC AAACCCAGAT GCAGCAACGC AGAACAGGCG ATTCCAGTTC 2 4 00 

ACTCAGAATC AGAAGAAAGA AGATTCTAAA ACGTCCACCT CGGTCACCAG TGTGAACCAA 2 4 60 

GCCAGCACAT CCCGCCTGGA GGGCCTACAG TCAGAAAACC ATCGCCTGCG AATGAAGATC 2 52 0 

ACAGAGCTGG ATAAAGACTT GGAAGAGGTC ACCATGCAGC TGCAGGACAC ACCAGAAAAG 25 80 

ACCAGCTACA TTAAACAGAA CCACTACC7U\ GAGCTCAATG ACATCCTCAA CCTGGGAAAC 2 64 0 

TTCACTGAGA GCACAGATGG AGGAAAGGCC ATTTTAAAAA ATCACCTCGA TCAAAATCCC 2 7 00 

CAGCTACAGT GGAACACAAC AGAGCCCTCT CGAACATGCA 7U\GATCCTAT AGAAGATATA 2 7 60 

AACTCTCCAG AACACATCCA GCGTCGGCTG TCCCTCCAGC TCCCCATCCT CCACCACGCC 2 82 0 

TACCTCCCAT CCATCGGAGG CGTGGACGCC AGCTGTGTCA GCCCCTGCGT CAGCCCCACC 2 88 0 

GCCAGCCCCC GCCACAGACA TGTGCCACCC TCCTTCCGAG TCATGGTCTC GGGCCTGTAA 2 94 0 

GGGTGGGAGG CCTGGGCCCG GGGCCTCCCC CGTGACAGAA CCACACTGGG CAGAGGGGTC 3 00 0 

TGCTGCAGAA ACACTGTCGG CTCTGGCTGC GGAGAAGCTG GGCACCATGG CTGGCCTCTC 3060 

AGGACCACTC GGATGGCACT CAGGTGGACA GGACGGGGCA GGGGGAGACT TGGCACCTGA 312 0 

CCTCGAGCCT TATTTGTGAA GTCCTTATTT CTTCACAAAG AAGAGGAACG GAAATGGGAC 3180 

GTCTTCCTTA ACATCTGCAA ACAAGGAGGC GCTGGGATAT CAAACTTGCA AAAAAAAAAA 32 4 0 

AAAA 3244 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 898 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) 7\NTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Ser Cys Pro Ala Arg Ser Ala Thr Gly Pro Leu Ser lie Met 
15 10 15 

Gly Leu Met Pro Leu Thr Lys Glu Val Ala Lys Gly Ser lie Gly Arg 
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♦ 20 25 30 

Gly Val Leu Pro Ala Val Glu Leu Ala lie Glu Gin lie Arg Asn Glu 
35 40 45 

Ser Leu Leu Arg Pro Tyr Phe Leu Asp Leu Arg Leu Tyr Asp Thr Glu 
50 55 ^ 60 

Cys Asp Asn Ala Lys Gly Leu Lys Ala Phe Tyr Asp Ala lie Lys Tyr 
65 70 75 80 

Gly Pro Asn His Leu Met Val Phe Gly Gly Val Cys Pro Ser Val Thr 
85 90 95 

Ser lie lie Ala Glu Ser Leu Gin Gly Trp Asn Leu Val Gin Leu Ser 
100 105 110 

Phe Ala Ala Thr Thr Pro Val Leu Ala Asp Lys Lys Lys Tyr Pro Tyr 
115 120 125 

Phe Phe Arg Thr Val Pro Ser Asp Asn Ala Val Asn Pro Ala lie Leu 
130 135 140 

Lys Leu Leu Lys His Tyr Gin Trp Lys Arg Val Gly Thr Leu Thr Gin 
145 150 155 160 

Asp Val Gin Arg Phe Ser Glu Val Arg Asn Asp Leu Thr Gly Val Leu 
165 170 175 

Tyr Gly Glu Asp lie Glu lie Ser Asp Thr Glu Ser Phe Ser Asn Asp 
180 185 190 

Pro Cys Thr Ser Val Lys Lys Leu Lys Gly Asn Asp Val Arg lie lie 
195 200 205 

Leu Gly Gin Phe Asp Gin Asn Met Ala Ala Lys Val Phe Cys Cys Ala 
210 215 220 

Tyr Glu Glu Asn Met Tyr Gly Ser Lys Tyr Gin Trp lie lie Pro Gly 
225 230 235 240 

Trp Tyr Glu Pro Ser Trp Trp Glu Gin Val His Thr Glu Ala Asn Ser 
245 250 255 

Ser Arg Cys Leu Arg Lys Asn Leu Leu Ala Ala Met Glu Gly Tyr lie 
260 265 270 

Gly Val Asp Phe Glu Pro Leu Ser Ser Lys Gin He Lys Thr He Ser 
275 280 285 

Gly Lys Thr Pro Gin Gin Tyr Glu Arg Glu Tyr Asn Asn Lys Arg Ser 
290 295 300 

Gly Val Gly Pro Ser Lys Phe His Gly Tyr Ala Tyr Asp Gly He Trp 
305 310 315 320 

Val He Ala Lys Thr Leu Gin Arg Ala Met Glu Thr Leu His Ala Ser 
325 330 335 

Ser Arg His Gin Arg He Gin Asp Phe Asn Tyr Thr Asp His Thr Leu 
340 345 350 



Gly Arg He . He Leu Asn Ala Met Asn Glu Thr Asn Phe Phe Gly Val 
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355 



360 



365 



Thr Gly Gin Val Val Phe Arg Asn Gly Glu 
370 375 



Arg Met Gly Thr lie Lys 
380 



Phe Thr Gin Phe Gin Asp Ser Arg Glu Val Lys Val Gly Glu Tyr Asn 
385 390 ^ 395 400 

Ala Val Ala Asp Thr Leu Glu lie lie Asn Asp Thr lie Arg Phe Gin 
405 410 415 

Gly Ser Glu Pro Pro Lys Asp Lys Thr lie lie Leu Glu Gin Leu Arg 
420 425 430 

Lys lie Ser Leu Pro Leu Tyr Ser lie Leu Ser Ala Leu Thr lie Leu 
435 440 445 

Gly Met lie Met Ala Ser Ala Phe Leu Phe Phe Asn lie Lys Asn Arg 
450 455 460 

Asn Gin Lys Leu He Lys Met Ser Ser Pro Tyr Met Asn Asn Leu lie 
465 470 475 480 

He Leu Gly Gly Met Leu Ser Tyr Ala Ser He Phe Leu Phe Gly Leu 
485 490 495 

Asp Gly Ser Phe Val Ser Glu Lys Thr Phe Glu Thr Leu Cys Thr Val 
500 505 510 

Arg Thr Trp He Leu Thr Val Gly Tyr Thr Thr Ala Phe Gly Ala Met 
515 520 525 

Phe Ala Lys Thr Trp Arg Val His Ala He Phe Lys Asn Val Lys Met 
530 535 540 

Lys Lys Lys He He Lys Asp Gin Lys Leu Leu Val He Val Gly Gly 
545 550 555 560 

Met Leu Leu He Asp Leu Cys He Leu He Cys Trp Gin Ala Val Asp 
565 570 575 

Pro Leu Arg Arg Thr Val Glu Lys Tyr Ser Met Glu Pro Asp Pro Ala 
580 585 590 

Gly Arg Asp He Ser He Arg Pro Leu Leu Glu His Cys Glu Asn Thr 
595 600 605 

His Met Thr He Trp Leu Gly He Val Tyr Ala Tyr Lys Gly Leu Leu 
610 615 620 

Met Leu Phe Gly Cys Phe Leu Ala Trp Glu Thr Arg Asn Val Ser He 
625 630 635 640 

Pro Ala Leu Asn Asp Ser Lys Tyr He Gly Met Ser Val Tyr Asn Val 
645 650 655 

Gly He Met Cys He He Gly Ala Ala Val Ser Phe Leu Thr Arg Asp 
660 665 670 

Gin Pro Asn Val Gin Phe Cys He Val Ala Leu Val He He Phe Cys 
675 680 685 

Ser Thr He Thr Leu Cys Leu Val Phe Val Pro Lys Leu He Thr Leu 
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(690 695 700 

Arg Thr Asn Pro Asp Ala Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr 
705 710 715 720 

Gin Asn Gin Lys Lys Glu Asp Ser Lys Thr Ser Thr Ser Val Thr Ser 
725 73^0 735 

Val Asn Gin Ala Ser Thr Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn 
740 745 750 

His Arg Leu Arg Met Lys lie Thr Glu Leu Asp Lys Asp Leu Glu Glu 
755 760 765 

Val Thr Met Gin Leu Gin Asp Thr Pro Glu Lys Thr Thr Tyr lie Lys 
770 775 780 

Gin Asn His Tyr Gin Glu Leu Asn Asp lie Leu Asn Leu Gly Asn Phe 
785 790 795 800 

Thr Glu Ser Thr Asp Gly Gly Lys Ala lie Leu Lys Asn His Leu Asp 
805 810 815 

Gin Asn Pro Gin Leu Gin Trp Asn Thr Thr Glu Pro Ser Arg Thr Cys 
820 825 830 

Lys Asp Pro lie Glu Asp lie Asn Ser Pro Glu His lie Gin Arg Arg 
835 840 845 

Leu Ser Leu Gin Leu Pro lie Leu His His Ala Tyr Leu Pro Ser lie 
850 855 860 

Gly Gly Val Asp Ala Ser Cys Val Ser Pro Cys Val Ser Pro Thr Ala 
865 870 875 880 

Ser Pro Arg His Arg His Val Pro Pro Ser Phe Arg Val Met Val Ser 
885 890 895 

Gly Leu 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
ATGGCTTCCC CGCCGAGCTC CGGGCAGCCC CGGCCGCCGC CGCCGCCGCC GCCGCCCGCG 
CGCCTGCTGC TGCCCCTGCT GCTGTCGCTG CTGCTGTGGT TGGCGCCCGG GGCCTGGGGC 
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TGGACGCG4G GCGCCCCCCG GCCGCCGCCC AGCAGCCCGC CGCTCTCCAT CATGGGCCTC 180 

ATGCCGCTCA CCAAGGAGGT GGCC7U\GGGC AGCATCGGGC GCGGCGTGCT CCCCGCCGTG 240 

GAGCTAGCCA TCGAGCAGAT CCGCAACGAG TCACTCCTGC GCCCCTACTT CCTGGACCTG 300 

CGACTCTATG ACACCGAGTG TGAC7\ATGCA AAGGGACTGA AAGCCTTCTA TGACGCAATA 3 60 

AAGTATGGGC CGAACCATTT GATGGTGTTT GGAGGCGTCT GTCCGTCTGT CACATCTATT 42 0 

ATCGCGGAGT CCCTCCAAGG CTGGAATCTG GTGCAGCTTT CCTTCGCCGC CACCACGCCT 480 

GTTCTTGCGG ATAAGAAGAA GTACCCGTAT TTCTTCCGGA CGGTGCCGTC AGACAACGCG 54 0 

GTGAACCCCG CCATCCTGAA GCTCCTGAAG CACTTCCGCT GGCGGCGTGT GGGCACACTC 60 0 

ACGCAGGACG TGCAGCGCTT CTCCGAGGTG AGGAATGACC TGACTGGGGT TCTGTATGGG 660 

GAAGATATTG AGATCTCAGA CACAGAGAGT TTCTCCAATG ATCCCTGCAC CAGCGTCAAA 72 0 

AAGCTCAAGG GGAATGACGT GCGGATCATC CTTGGCCAGT TTGACCAGAA TATGGCAGCA 780 

AAAGTCTTCT GTTGTGCCTT CGAGGAGAGC ATGTTTGGCA GCAAGTACCA GTGGATCATC 840 

CCGGGATGGT ACGAGCCTGC GTGGTGGGAG CAGGTGCATG TGGAGGCCAA TTCCTCACGC 900 

TGCCTGCGCA GAAGCCTCCT GGCTGCCATG GAAGGTTACA TCGGAGTGGA CTTTGAGCCC 9 60 

CTGAGCTCCA AACAAATCAA GACCATCTCA GGGAAGACTC CACAGCAGTA TGAAAGAGAG 102 0 

TACAACAGCA AACGTTCAGG CGTGGGGCCC AGCAAGTTCC ATGGGTACGC CTACGATGGG 108 0 

ATCTGGGTCA TCGCCAAGAC CCTACAGAGG GCCATGGAGA CACTGCATGC CAGTAGCAGG 1140 

CACCAGCGGA TCCAGGACTT CAACTACACA GACCACACGC TGGGCAAAAT CATCCTCAAT 12 0 0 

GCCATGAACG AGACCAACTT CTTCGGGGTC ACGGGTCAAG TTGTGTTCCG GAACGGGGAG 12 60 

AGAATGGGAA CCATTAAATT TACTCAATTT CAAGACAGCA GAGAGGTGAA GGTCGGCGAA 132 0 

TACAACGCGG TGGCTGACAC ACTGGAGATC ATCAATGACA CCATAAGGTT CCAGGGGTCC 138 0 

GAGCCACCCA AGGACAAGAC CATCATTCTG GAGCAGCTTC GGAAGATCTC GCTTCCACTG 1440 

TATAGCATCC TGTCCGCTCT CACCATCCTC GGCATGATCA TGGCCAGCGC CTTCCTCTTC 150 0 

TTCAACATCA AGAACCGGAA CCAAAAGCTG ATTAAGATGT CAAGCCCCTA CATGAACAAC 15 60 

CTCATCATCC TGGGAGGAAT GCTGTCCTAT GCATCCATCT TCCTCTTTGG CCTCGATGGG 162 0 

TCCTTCGTCT CAGA7\AAGAC CTTTGAAACA CTCTGCACGG TCCGGACCTG GATTCTCACC 168 0 

GTGGGCTACA CAACTGCCTT TGGGGCCATG TTTGCAAAGA CCTGGAGGGT CCATGCCATC 17 4 0 

TTCAAAAATG TGAAGATGAA GAAGAAGATC ATCAAAGACC AGAAGCTGCT TGTGATTGTG 18 00 

GGGGGCATGC TGCTCATCGA CCTGTGCATC CTGATCTGTT GGCAGGCTGT GGACCCCCTG 18 60 

CGGAGGACAG TAGAGAGGTA CAGCATGGAG CCGGACCCAG CAGGCCGGGA CATCTCCATC 192 0 

CGCCCATTGC TGGAACACTG CGAAAACACC CACATGACCA TCTGGCTTGG CATTGTCTAC 198 0 

GCCTACAAGG GGCTCCTCAT GCTATTCGGT TGTTTCTTGG CATGGGAAAC CCGCAATGTG 2 04 0 
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AGCATCCCtTG CCCTCAACGA CAGCAAGTAC ATCGGCATGA GTGTGTACAA TGTGGGGATC 2100 

ATGTGCATCA TCGGGGCTGC TGTCTCCTTC CTGACGCGTG ACCAGCCCAA CGTGCAGTTC 2160 

TGCATCGTGG CCCTGGTCAT CATCTTCTGC AGCACCATCA CTCTCTGCCT GGTGTTTGTG 222 0 

CCAAAGCTCA TTACTCTGAG GACAAACCCT GACGCAGCCA CTCAGAACAG GCGGTTCCAG 22 8 0 

TTCACACAGA ACCAGAAGAA AGAAGATTCG 7\AGACCTCCA CTTCAGTCAC CAGCGTGAAC 2 34 0 

CAGGCGAGCA CGTCACGCCT GGAGGGACTG CAGTCAGAAA ACCACCGCCT TCGAATGAAG 2 4 00 

ATCACAGAGC TGGACAAAGA CTTGG7\AGAA GTCACCATGC AGCTACAAGA CACACCAGAG 2 4 60 

AAGACCACAT ACATCAAACA GAATCACTAC CAAGAGCTCA ACGACATCCT CAGCTTGGGC 2 52 0 

AACTTCACAG AGAGCACAGA TGGAGGAAAG GCCATTCTAA AAAATCACCT CGATCAAAAC 258 0 

CCCCAGCTCC AGTGGAACAC GACAGAGCCC TCAAGAACAT GCAAAGACCC CATAGAAGAC 2 64 0 

ATCAACTCCC CGGAGCACAT CCAGCGCCGG CTGTCGCTCC AGCTCCCCAT CCTTCACCAC 27 00 

GCCTACCTCC CATCCATCGG AGGCGTGGAT GCCAGCTGCG TCAGCCCCTG TGTCAGCCCT 2 7 60 

ACCGCCAGCC CTCGCCACAG ACACGTACCA CCCTCCTTCC GAGTCATGGT CTCGGGCCTG 2 82 0 

TAG 2 8 2 3 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ala Ser Pro Pro Ser Ser Gly Gin Pro Arg Pro Pro Pro Pro Pro 
15 10 15 

Pro Pro Pro Ala Arg Leu Leu Leu Pro Leu Leu Leu Ser Leu Leu Leu 
20 25 30 

Trp Leu Ala Pro Gly Ala Trp Gly Trp Thr Arg Gly Ala Pro Arg Pro 
35 40 45 

Pro Pro Ser Ser Pro Pro Leu Ser lie Met Gly Leu Met Pro Leu Thr 
50 55 60 

Lys Glu Val Ala Lys Gly Ser lie Gly Arg Gly Val Leu Pro Ala Val 
65 70 75 80 



Glu 



Leu 



Ala lie Glu Gin lie Arg Asn Glu Ser Leu Leu Arg Pro Tyr 
85 90 95 
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Phe' Leu Asp Leu 
100 

Leu Lys Ala Phe 
115 

Val Phe Gly Gly 
130 

Leu Gin Gly Trp 
145 

Val Leu Ala Asp 



Ser Asp Asn Ala 
180 

Arg Trp Arg Arg 
195 

Glu Val Arg Asn 
210 

lie Ser Asp Thr 
225 

Lys Leu Lys Gly 



Asn Met Ala Ala 

260 

Gly Ser Lys Tyr 
275 

Trp Glu Gin Val 
290 

Ser Leu Leu Ala 
305 

Leu Ser Ser Lys 



Tyr Glu Arg Glu 
340 

Phe His Gly Tyr 
355 

Gin Arg Ala Met 
370 

Gin Asp Phe Asn 
385 

Ala Met Asn Glu 



Arg Leu Tyr Asp 



Tyr Asp Ala lie 
120 

Val Cys Pro Ser 
135 

Asn Leu Val Gin 
150 

Lys Lys Lys Tyr 
165 

Val Asn Pro Ala 



Val Gly Thr Leu 
200 

Asp Leu Thr Gly 
215 

Glu Ser Phe Ser 
230 

Asn Asp Val Arg 
245 

Lys Val Phe Cys 



Gin Trp lie lie 
280 

His Val Glu Ala 
295 

Ala Met Glu Gly 
310 

Gin lie Lys Thr 
325 

Tyr Asn Ser Lys 



Ala Tyr Asp Gly 
360 

Glu Thr Leu His 
375 

Tyr Thr Asp His 
390 

Thr Asn Phe Phe 
405 



Thr Glu Cys Asp 
105 

Lys Tyr Gly Pro 



Val THr Ser lie 
140 

Leu Ser Phe Ala 
155 

Pro Tyr Phe Phe 
170 

lie Leu Lys Leu 
185 

Thr Gin Asp Val 



Val Leu Tyr Gly 

220 

Asn Asp Pro Cys 
235 

lie lie Leu Gly 
250 

Cys Ala Phe Glu 
265 

Pro Gly Trp Tyr 



Asn Ser Ser Arg 
300 

Tyr lie Gly Val 
315 

lie Ser Gly Lys 
330 

Arg Ser Gly Val 
345 

lie Trp Val lie 



Ala Ser Ser Arg 
380 

Thr Leu Gly Lys 

395 

Gly Val Thr Gly 
410 



Asn Ala Lys Gly 
110 

Asn His Leu Met 
125 

lie Ala Glu Ser 



Ala Thr Thr Pro 
160 

Arg Thr Val Pro 
175 

Leu Lys His Phe 
190 

Gin Arg Phe Ser 
205 

Glu Asp lie Glu 



Thr Ser Val Lys 
240 

Gin Phe Asp Gin 
255 

Glu Ser Met Phe 
270 

Glu Pro Ala Trp 
285 

Cys Leu Arg Arg 



Asp Phe Glu Pro 
320 

Thr Pro Gin Gin 
335 

Gly Pro Ser Lys 
350 

Ala Lys Thr Leu 
365 

His Gin Arg lie 



lie lie Leu Asn 
400 

Gin Val Val Phe 
415 



Arg Asn Gly Glu Arg Met Gly Thr lie Lys Phe Thr Gin Phe Gin Asp 
420 425 430 



146 



SertArg Glu Val 
435 

Glu lie lie Asn 
450 

Asp Lys Thr lie 
465 

Tyr Ser lie Leu 



Ala Phe Leu Phe 
500 

Met Ser Ser Pro 
515 

Ser Tyr Ala Ser 
530 

Glu Lys Thr Phe 
545 

Val Gly Tyr Thr 



Val His Ala lie 
580 

Asp Gin Lys Leu 
595 

Cys lie Leu lie 
610 

Glu Arg Tyr Ser 
625 

Arg Pro Leu Leu 



Gly lie Val Tyr 
660 

Leu Ala Trp Glu 
675 

Lys Tyr lie Gly 
690 

Gly Ala Ala Val 
705 

Cys lie Val Ala 



Leu Val Phe Val 
740 



Lys Val Gly Glu 
440 

Asp Thr lie Arg 
455 

lie Leu Glu Gin 
470 

Ser Ala Leu Thr 
485 

Phe Asn lie Lys 



Tyr Met Asn Asn 
520 

lie Phe Leu Phe 
535 

Glu Thr Leu Cys 
550 

Thr Ala Phe Gly 
565 

Phe Lys Asn Val 



Leu Val lie Val 
600 

Cys Trp Gin Ala 
615 

Met Glu Pro Asp 
63 0 

Glu His Cys Glu 
645 

Ala Tyr Lys Gly 



Thr Arg Asn Val 
680 

Met Ser Val Tyr 
695 

Ser Phe Leu Thr 
710 

Leu Val lie lie 
725 

Pro Lys Leu lie 



Tyr Asn Ala Val 



Phe Gin Gly Ser 
460 

Leu Ajfg Lys lie 
475 

lie Leu Gly Met 
490 

Asn Arg Asn Gin 
505 

Leu lie lie Leu 



Gly Leu Asp Gly 
540 

Thr Val Arg Thr 
555 

Ala Met Phe Ala 
570 

Lys Met Lys Lys 
585 

Gly Gly Met Leu 



Val Asp Pro Leu 
620 

Pro Ala Gly Arg 
635 

Asn Thr His Met 
650 

Leu Leu Met Leu 
665 

Ser lie Pro Ala 



Asn Val Gly lie 
700 

Arg Asp Gin Pro 
715 

Phe Cys Ser Thr 
730 

Thr Leu Arg Thr 
745 



Ala Asp Thr Leu 
445 

Glu Pro Pro Lys 



Ser Leu Pro Leu 
480 

lie Met Ala Ser 
495 

Lys Leu lie Lys 
510 

Gly Gly Met Leu 
525 

Ser Phe Val Ser 



Trp lie Leu Thr 
560 

Lys Thr Trp Arg 

575 

Lys lie lie Lys 
590 

Leu lie Asp Leu 
605 

Arg Arg Thr Val 



Asp lie Ser lie 
640 

Thr lie Trp Leu 
655 

Phe Gly Cys Phe 
670 

Leu Asn Asp Ser 
685 

Met Cys lie lie 



Asn Val Gin Phe 
720 

lie Thr Leu Cys 
735 

Asn Pro Asp Ala 
750 



Ala Thr 



Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys Glu 
755 760 765 



147 



Asp' Ser Lys Thr 
770 

Ser Arg Leu Glu 
785 

lie Thr Glu Leu 



Asp Thr Pro Glu 
820 

Leu Asn Asp lie 
835 

Gly Lys Ala lie 
850 

Trp Asn Thr Thr 
865 

lie Asn Ser Pro 



Ser Thr Ser Val 
775 

Gly Leu Gin Ser 
790 

Asp Lys Asp Leu 
805 

Lys Thr Thr Tyr 



Leu Ser Leu Gly 
840 

Leu Lys Asn His 
855 

Glu Pro Ser Arg 
870 

Glu His lie Gin 
885 



Thr Ser Val Asn 
780 

Glu Asn His Arg 
795 

Glu Glu Val Thr 
810 

lie Lys Gin Asn 
825 

Asn Phe Thr Glu 



Leu Asp Gin Asn 
860 

Thr Cys Lys Asp 
875 

Arg Arg Leu Ser 
890 



Gin Ala Ser Thr 



Leu Arg Met Lys 
800 

Met Gin Leu Gin 
815 

His Tyr Gin Glu 
830 

Ser Thr Asp Gly 
845 

Pro Gin Leu Gin 



Pro lie Glu Asp 
880 

Leu Gin Leu Pro 
895 



lie Leu His His Ala Tyr Leu Pro Ser lie Gly Gly Val Asp Ala Ser 
900 905 910 

Cys Val Ser Pro Cys Val Ser Pro Thr Ala Ser Pro Arg His Arg His 
915 920 925 

Val Pro Pro Ser Phe Arg Val Met Val Ser Gly Leu 
930 935 940 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
AGGGATGCTT TCCTATGCTT CCATATTTCT CTTTGGCCTT GATGG 4 5 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(iii) I HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CAATGTGCAG TTCTGCATCG TGGCTCTGGT CATCATCTTC TGCAG 4 5 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAhJDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
CTTCTAGGCC TGTACGGAAG TGTT 2 4 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRAiaDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
GTTGTGGTTT GTCCAAACTC ATCAAT 2 6 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
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GGGATGAGtG TCTACAACGT GGGG 



(2) INFORMATION FOR SEQ ID NO: 10: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TGCGTTGCTG CATCTGGGTT TGTTCT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
ATCTCCCTAC CTCTCTACAG CATCCT 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CAGGTCCTGA CGGTGCAAAG TGTTTC 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
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I (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGACGCAAGA CGTTCAGAGG TTCTCT 



(2) INFORMATION FOR SEQ ID NO: 14: 

, (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TGTAGCCTTC CATGGCAGCA AGCAGA 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AGAGAACCTC TGAACGTCTT GCGTCA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 



{iii)l HYPOTHETICAL: NO 



(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGCTCTGTTG TGTTCCACTG TAGCTG 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic ac 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
TCATGCCGCT CACCAAGGAG GTGGCC 



2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic ac 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GGCCACCTCC TTGGTGAGCG GCATGA 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic ac 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
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TGAGTGAGCA GAGTCCAGAG CCGT 



(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) 7\NTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
ATGGATGGGA GGTAGGCGTG GTGGAG 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
{iii ) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
CTCTCTGCCC TCACCATCCT CGGGAT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic aci 
( iii ) HYPOTHETICAL : NO 
(iv) ANTI -SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
GACTCCGGCT CGAATACCAG GCAGAG 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
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I (A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCATGTTTGC AAAGACCTGG AGGGTCC 



(2) INFORMATION FOR SEQ ID NO: 24: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GGTCACGCGT CAGGAAAGAG ACAGCAG 



(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHAFIACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
( iii ) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AAGCTTCTAG AGATCCCTCG ACCTC 



(2) INFORMATION FOR SEQ ID NO : 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(iii) ' HYPOTHETICAL: NO 
(iv) TUsJTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 
AGGCGCAGAA CTGGTAGGTA TGGAA 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27 
CTTCTAGGCC TGTACGGAAG TGTTA 



(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 
GTTGTGGTTT GTCCAAACTC ATCAATG 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:29 



CTGCTGTCtC TTTCCTGACG CGTGACC 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 59 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CCAAGCTTCT AATACGACTC ACTATAGGGG AGACCATGGG CCCGGGGGGA CCCTGTACC 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
TTTTTTTTTT TTTTTTTTTT TTTTTTTTTT TTTTTCACTT GTAAAGCAAA TGTACTCGAC 
TCC 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CGCGGATCtA TTATGTCTGC ACTCCGAAGG AAATTTG 



(2) INFORmTION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CGCGAATTCT TATGTGAAGC GATCAGAGTT CATTTTTC 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GCGGGATCCG CTATGGCTGG TGATTCTAGG AATG 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3?. 
CCGGAATTCC CCTCACACCG AGCCCCTGG 



(2) INFORMATION FOR SEQ ID NO : 3 6 : 
(i) SEQUENCE CHARACTERISTICS: 



' (A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GCPOVTPJkAGT ATGGGCTGAA CCATTTGATG GTGTTTGGAG GCGT 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 37: 
ACGCCTCCAA ACACCATCAA ATGGTTCAGC CCATACTTTA TTGC 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
( iii ) HYPOTHETICAL : NO 
(iv) ANTI-SENSE : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
TTTGAGCCCC TGAGCTCCAA ACAAATCAAG ACCATCTCAG 



(2) INFORMATION FOR SEQ ID NO: 39: 
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(i) 'SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CTGAGATGGT CTTGATTTGT TTGGAGCTCA GGGGCTCAAA 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
AAGGCCATCA ACTTCCTGCC TGTGGACTAT GAGATCGAAT ATG 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CATATTCGAT CTCATAGTCC ACAGGCAGGA AGTTGATGGC CTT 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
TGGCCGCTGC CTCTTCTGCT GGTGATGGCG GCTGGGGT 



(2) INFORMATION FOR SEQ ID NO : 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
ACCCCAGCCG CCATCACCAG CAGAAGAGGC AGCGGCCA 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 4 
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CCTTGCCTtT GGCCTTGAAC AAGACGTCTG GAGGAGGTGG TCGTT 4 5 

(2) INFOR^4ATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 
AACGACCACC TCCTCCAGAC GTCTTGTTCA AGGCCAAAGC CAAGG 4 5 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ. ID NO: 46: 



ATGGCTTCCC 


CGCGGAGCTC 


CGGGCAGCCC 


GGGCCGCCGC 


cgccgccgcc 


ACCGCCGCCC 


60 


GCGCGCCTGC 


TACTGCTACT 


GCTGCTGCCG 


CTGCTGCTGC 


CTCTGGCGCC 


CGGGGCCTGG 


120 


GGCTGGGCGC 


GGGGCGCCCC 


CCGGCCGCCG 


CCCAGCAGCC 


CGCCGCTCTC 


CATCATGGGC 


180 


CTCATGCCGC 


TCACCAAGGA 


GGTGGCCAAG 


GGCAGCATCG 


GGCGCGGTGT 


GCTCCCCGCC 


240 


GTGGAACTGG 


CCATCGAGCA 


GATCCGCAAC 


GAGTCACTCC 


TGCGCCCCTA 


CTTCCTCGAC 


300 


CTGCGGCTCT 


ATGACACGGA 


GTGCGACAAC 


GCPlAAAGGGT 


TGAAAGCCTT 


CTACGATGCG 


360 


ATAAAATACG 


GGCCGAACCA 


CTTGATGGTG 


TTTGGAGGCG 


TCTGTCCATC 


CGTCACATCC 


420 


ATCATTGCAG 


AGTCCCTCCA 


AGGCTGGAAT 


CTGGTGCAGC 


TTTCTTTTGC 


TGCAACCACG 


480 


CCTGTTCTAG 


CCGATAAGAA 


AAAATACCCT 


TATTTCTTTC 


GGACCGTCCC 


ATCAGACAAT 


540 


GCGGTGAATC 


CAGCCATTCT 


GAAGTTGCTC 


AAGCACTACC 


AGTGGAAGCG 


CGTGGGCACG 


600 



161 



CTGACGCAAG 


ACGTTCAGAG 


GTTCTCTGAG 


GTGCGGAATG 


ACCTGACTGG 


AGTTCTGTAT 


660 


GGCGAGGACA 


TTGAGATTTC 


AGACACCGAG 


AGCTTCTCCA 


ACGATCCCTG 


TACCAGTGTC 


720 


AAAAAGCTGA 


AGGGGAAT GA 


TGTGCGGATC 


ATCCTTGGCC 


AGTTTGACCA 


GAATATGGCA 


780 


G C AAAAGT GT 


TCTGTTGTGC 


ATACGAGGAG 


AACATGTATG 


GTAGT7\AATA 


TCAGTGGATC 


840 


ATTCCGGGCT 


GGTACGAGCC 


TTCTTGGTGG 


GAGCAGGTGC 


ACACGGAAGC 


CAACTCATCC 


900 


CGCTGCCTCC 


GGAAGAAT C T 


GCTTGCTGCC 


ATGGAGGGCT 


ACATTGGCGT 


GGATTTCGAG 


960 


CCCCTGAGCT 


CCAAGCAGAT 


CAAGACCATC 


TCAGGAAAGA 


CTCCACAGCA 


GT AT GAG AG A 


1020 


GAGTACAACA 


ACAAGCGGTC 


AGGCGTGGGG 


CCCAGCAAGT 


TCCACGGGTA 


CGCCTACGAT 


1080 


GGCATCTGGG 


TCATCGCCAA 


GACACTGCAG 


AGGGCCATGG 


AGACACTGCA 


TGCCAGCAGC 


1140 


C G GC AC C AG C 


GGAT C C AGGA 


CTTCAACTAC 


ACGGACCACA 


CGCTGGGCAG 


GATCATCCTC 


1200 


AAT G C CAT G A 


ACGAGACCAA 


CTTCTTCGGG 


GTCACGGGTC 


TVAGTTGTATT 


CCGGAATGGG 


1260 


GAGAGAAT GG 


GGACCATTAA 


ATTTACTCAA 


TTTCAAGACA 


GCAGGGAGGT 


GAAGGTGGGA 


1320 


GAGT AC AAC G 


CTGTGGCCGA 


CACACTGGAG 


AT CATC AAT G 


ACACCATCAG 


GTTCCAAGGA 


1380 


T r C GAAC C AC 


CAAAAGACAA 


GAC CAT CATC 


CTGGAGCAGC 


TGCGGAAGAT 


CTCCCTACCT 


1440 


PTTTACAGCA 


TCCTCTCTGC 


CCTCACCATC 


CTCGGGATGA 


TCATGGCCAG 


TGCTTTTCTC 


1500 




T C A AG7VAC C G 


GAAT C AGAAG 


CTCATAAAGA 


TGTCGAGTCC 


ATACATGAAC 


1560 




TPPTTGGAGG 


GATGCTTTCC 


TATGCTTCCA 


TATTTCTCTT 


TGGCCTTGAT 


1620 




TCTCTGAAAA 


GACCTTTGAA 


ACACTTTGCA 


CCGTCAGGAC 


CTGGATTCTC 


1680 


ArCGTGGGCT 


ACACGACCGC 


TTTTGGGGCC 


ATGTTTGCAA 


AGACCTGGAG 


AGTCCACGCC 


1740 


ATCTTCAAAA 


AT GT GAAAAT 


gaag;\agaag 


ATCATCAAGG 


ACCAGA7U\CT 


GCTTGTGATC 


1800 


GTGGGGGGCA 


TGCTGCTGAT 


CGACCTGTGT 


ATCCTGATCT 


GCTGGCAGGC 


TGTGGACCCC 


1860 


CTGCGAAGGA 


CAGTGGAGAA 


GTACAGCATG 


GAGCCGGACC 


CAGCAGGACG 


GGATATCTCC 


1920 


ATCCGCCCTC 


TCCTGGAGCA 


CTGTGAGAAC 


ACCCATATGA 


CCATCTGGCT 


TGGCATCGTC 


1980 


TAT GC CT ACA 


AGGGACTTCT 


CATGTTGTTC 


GGTTGTTTCT 


TAGCTTGGGA 


GACCCGCAAC 


2040 


GT C AGC AT C C 


CCGCACTCAA 


CGACAGCAAG 


TACATCGGGA 


TGAGTGTCTA 


CAACGTGGGG 


2100 


AT CAT GT GC A 


T CAT C GGGGC 


CGCTGTCTCC 


TTCCTGACCC 


GGGACCAGCC 


CAATGTGCAG 


2160 


TTrTr;r'ATPG 

X X ^ X J. v_. *J 


TGGCTCTGGT 


CATCATCTTC 


TGCAGCACCA 


TCACCCTCTG 


CCTGGTATTC 


2220 


GTGCCGAAGC 


TCATCACCCT 


GAGAACAAAC 


CCAGATGCAG 


CAACGCAGAA 


C AGGC GAT T C 


22 8 0 


CAGTTCACTC 


AGAATCAGAA 


GAAAGAAGAT 


TCTAAAACGT 


CCACCTCGGT 


CACCAGTGTG 


2340 


AACCAAGCCA 


GCACATCCCG 


CCTGGAGGGC 


CTACA.GTCAG 


AAAACCATCG 


CCTGCGAATG 


2400 


AAGATCACAG 


AGCTGGATAA 


AGACTTGGAA 


GAGGTCACCA 


TGCAGCTGCA 


GGACACACCA 


2460 


GAAAAGACCA 


CCTACATTAA 


ACAGAACCAC 


TACCAAGAGC 


TCAATGACAT 


CCTCAACCTG 


2520 
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GGAAACTltA CTGAGAGCAC AGATGGAGGA AAGGCCATTT TAAA7WVTCA CCTCGATCAA 25 80 

AATCCCCAGC TACAGTGGTU^ CACAACAGAG CCCTCTCGAA CATGCAAAGA TCCTATAGAA 2 64 0 

GATATAAACT CTCCAGAACA CATCCAGCGT CGGCTGTCCC TCCAGCTCCC CATCCTCCAC 27 00 

CACGCCTACC TCCCATCCAT CGGAGGCGTG GACGCCAGCl' GTGTCAGCCC CTGCGTCAGC 27 60 

CCCACCGCCA GCCCCCGCCA CAGACATGTG CCACCCTCCT TCCGAGTCAT GGTCTCGGGC 2 82 0 

CTGTAA 2 82 6 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



Met Ala Ser Pro Arg Ser Ser Gly Gin Pro Gly Pro Pro Pro Pro Pro 

15 10 15 

Pro Pro Pro Pro Ala Arg Leu Leu Leu Leu Leu Leu Leu Pro Leu Leu 

20 25 30 



Leu Pro Leu Ala 
35 

Pro Pro Pro Ser 
50 

Thr Lys Glu Val 
65 

Val Glu Leu Ala 



Tyr Phe Leu Asp 
100 

Gly Leu Lys Ala 
115 

Met Val Phe Gly 
130 

Ser Leu Gin Gly 
145 

Pro Val Leu Ala 



Pro Gly Ala Trp 
40 

Ser Pro Pro Leu 
55 

Ala Lys Gly Ser 
70 

lie Glu Gin lie 
85 

Leu Arg Leu Tyr 



Phe Tyr Asp Ala 
120 

Gly Val Cys Pro 
135 

Trp Asn Leu Val 
150 

Asp Lys Lys Lys 
165 



Gly Trp Ala Arg 



Ser lie Met Gly 
60 

lie Gly Arg Gly 
75 

Arg Asn Glu Ser 
90 

Asp Thr Glu Cys 
105 

lie Lys Tyr Gly 



Ser Val Thr Ser 
140 

Gin Leu Ser Phe 
155 

Tyr Pro Tyr Phe 
170 



Gly Ala Pro Arg 
45 

Leu Met Pro Leu 



Val Leu Pro Ala 
80 

Leu Leu Arg Pro 
95 

Asp Asn Ala Lys 
110 

Pro Asn His Leu 
125 

lie lie Ala Glu 



Ala Ala Thr Thr 
160 

Phe Arg Thr Val 
175 



Pro Ser Asp Asn Ala Val Asn Pro Ala lie Leu Lys Leu Leu Lys His 
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I 180 

Tyr Gin Trp Lys 
195 

Ser Glu Val Arg 
210 

Glu lie Ser Asp 
225 

Lys Lys Leu Lys 



Gin Asn Met Ala 
260 

Tyr Gly Ser Lys 
275 

Trp Trp Glu Gin 
290 

Lys Asn Leu Leu 
305 

Pro Leu Ser Ser 



Gin Tyr Glu Arg 
340 

Lys Phe His Gly 
355 

Leu Gin Arg Ala 
370 

lie Gin Asp Phe 
385 

Asn Ala Met Asn 



Phe Arg Asn Gly 
420 

Asp Ser Arg Glu 
435 

Leu Glu lie lie 
450 

Lys Asp Lys Thr 
465 

Leu Tyr Ser lie 



Ser Ala Phe Leu 
500 

Lys Met Ser Ser 



Arg Val Gly Thr 
200 

Asn Asp Leu Thr 
215 

Thr Glu Ser Phe 
230 

Gly Asn Asp Val 

245 

Ala Lys Val Phe 



Tyr Gin Trp lie 
280 

Val His Thr Glu 
295 

Ala Ala Met Glu 
310 

Lys Gin lie Lys 
325 

Glu Tyr Asn Asn 



Tyr Ala Tyr Asp 
360 

Met Glu Thr Leu 
375 

Asn Tyr Thr Asp 
390 

Glu Thr Asn Phe 
405 

Glu Arg Met Gly 



Val Lys Val Gly 
440 

Asn Asp Thr lie 
455 

lie lie Leu Glu 
470 

Leu Ser Ala Leu 
485 

Phe Phe Asn lie 



Pro Tyr Met Asn 



185 

Leu Thr Gin Asp 



Gly Val Leu Tyr 
220 

Ser Asn Asp Pro 
235 

Arg lie lie Leu 
250 

Cys Cys Ala Tyr 
265 

lie Pro Gly Trp 



Ala Asn Ser Ser 
300 

Gly Tyr lie Gly 
315 

Thr lie Ser Gly 
330 

Lys Arg Ser Gly 
345 

Gly lie Trp Val 



Plis Ala Ser Ser 
380 

His Thr Leu Gly 
395 

Phe Gly Val Thr 
410 

Thr lie Lys Phe 
425 

Glu Tyr Asn Ala 



Arg Phe Gin Gly 
460 

Gin Leu Arg Lys 
475 

Thr He Leu Gly 
490 

Lys Asn Arg Asn 
505 

Asn Leu He He 



190 

Val Gin Arg Phe 
205 

Gly Glu Asp He 



Cys Thr Ser Val 
240 

Gly Gin Phe Asp 
255 

Glu Glu Asn Met 
270 

Tyr Glu Pro Ser 
285 

Arg Cys Leu Arg 



Val Asp Phe Glu 
320 

Lys Thr Pro Gin 
335 

Val Gly Pro Ser 
350 

He Ala Lys Thr 
365 

Arg His Gin Arg 



Arg He He Leu 
400 

Gly Gin Val Val 
415 

Thr Gin Phe Gin 
430 

Val Ala Asp Thr 
445 

Ser Glu Pro Pro 



He Ser Leu Pro 
480 

Met He Met Ala 
495 

Gin Lys Leu He 
510 

Leu Gly Gly Met 
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I 515 520 525 

Leu Ser Tyr Ala Ser lie Phe Leu Phe Gly Leu Asp Gly Ser Phe Val 
530 535 540 

Ser Glu Lys Thr Phe Glu Thr Leu Cys Thr Val Arg Thr Trp lie Leu 
545 550 ^ 555 560 

Thr Val Gly Tyr Thr Thr Ala Phe Gly Ala Met Phe Ala Lys Thr Trp 
565 570 575 

Arg Val His Ala lie Phe Lys Asn Val Lys Met Lys Lys Lys lie lie 
580 585 590 

Lys Asp Gin Lys Leu Leu Val lie Val Gly Gly Met Leu Leu lie Asp 
595 600 605 

Leu Cys lie Leu lie Cys Trp Gin Ala Val Asp Pro Leu Arg Arg Thr 
610 615 620 

Val Glu Lys Tyr Ser Met Glu Pro Asp Pro Ala Gly Arg Asp lie Ser 
625 630 635 640 

lie Arg Pro Leu Leu Glu His Cys Glu Asn Thr His Met Thr lie Trp 
645 650 655 

Leu Gly lie Val Tyr Ala Tyr Lys Gly Leu Leu Met Leu Phe Gly Cys 
660 665 670 

Phe Leu Ala Trp Glu Thr Arg Asn Val Ser lie Pro Ala Leu Asn Asp 
675 680 685 

Ser Lys Tyr lie Gly Met Ser Val Tyr Asn Val Gly lie Met Cys lie 
690 695 700 

lie Gly Ala Ala Val Ser Phe Leu Thr Arg Asp Gin Pro Asn Val Gin 
705 710 715 720 

Phe Cys lie Val Ala Leu Val lie He Phe Cys Ser Thr He Thr Leu 
725 730 735 

Cys Leu Val Phe Val Pro Lys Leu He Thr Leu Arg Thr Asn Pro Asp 
740 745 750 

Ala Ala Thr Gin Asn Arg Arg Phe Gin Phe Thr Gin Asn Gin Lys Lys 
755 760 765 

Glu Asp Ser Lys Thr Ser Thr Ser Val Thr Ser Val Asn Gin Ala Ser 
770 775 780 

Thr Ser Arg Leu Glu Gly Leu Gin Ser Glu Asn His Arg Leu Arg Met 
785 790 795 800 

Lys He Thr Glu Leu Asp Lys Asp Leu Glu Glu Val Thr Met Gin Leu 
805 810 815 

Gin Asp Thr Pro Glu Lys Thr Thr Tyr He Lys Gin Asn His Tyr Gin 
820 825 830 

Glu Leu Asn Asp He Leu Asn Leu Gly Asn Phe Thr Glu Ser Thr Asp 
835 840 845 

Gly Gly Lys Ala He Leu Lys Asn His Leu Asp Gin Asn Pro Gin Leu 
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t 850 855 860 

Gin Trp Asn Thr Thr Glu Pro Ser Arg Thr Cys Lys Asp Pro lie Glu 
865 870 875 880 

Asp lie Asn Ser Pro Glu His lie Gin Arg Arg Leu Ser Leu Gin Leu 
885 850 895 

Pro lie Leu His His Ala Tyr Leu Pro Ser lie Gly Gly Val Asp Ala 
900 905 910 

Ser Cys Val Ser Pro Cys Val Ser Pro Thr Ala Ser Pro Arg His Arg 
915 920 925 

His Val Pro Pro Ser Phe Arg Val Met Val Ser Gly Leu 
930 935 940 



(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Pro Leu Tyr Ser lie Leu Ser Ala Leu Thr lie Leu Gly Met lie Met 
15 10 15 

Ala Ser Ala Phe Leu Phe Phe Asn lie Lys Asn 
20 25 



(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

Leu lie lie Leu Gly Gly Met Leu Ser Tyr Ala Ser lie Phe Leu Phe 
15 10 15 

Gly Leu Asp Gly Ser Phe Val Ser Glu Lys Thr 
20 25 
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(2) INFdRiyiATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Cys Thr Val Arg Thr Trp lie Leu Thr Val Gly Tyr Thr Thr Ala Phe 
15 10 15 

Gly Ala Met Phe Ala Lys Thr Trp Arg 
20 25 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Gin Lys Leu Leu Val lie Val Gly Gly Met Leu Leu lie Asp Leu Cys 
15 10 15 

lie Leu lie Cys Trp Gin 
20 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Met Thr lie Trp Leu Gly lie Val Tyr Ala Tyr Lys Gly Leu Leu Met 
15 10 15 
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Leii Phe Gly Cys Phe Leu Ala Trp 
20 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
( iii ) HYPOTHETICAL : NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Ala Leu Asn Asp Ser Lys Tyr He Gly Met Ser Val Tyr Asn Val Gly 
15 10 15 

He Met Cys He He Gly Ala Ala Val 
20 25 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Cys He Val Ala Leu Val He He Phe Cys Ser Thr He Thr Leu Cys 
1 5 10 15 

Leu Val Phe Val Pro Lys Leu He Thr Leu Arg Thr Asn 
20 25 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
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(xi)l SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Gly Pro Gly Gly Pro Cys Thr Pro Val Gly Trp Pro Leu Pro Leu 
15 10 15 

Leu Leu Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 ^ 30 

Pro His Leu Pro Arg Pro His Pro Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 

lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
165 170 175 

Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys Thr Tyr Asp 
275 280 285 

Pro Ser lie Asn Cys Thr Val Glu Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 



lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
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I 325 330 335 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 ^ 365 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val lie 
450 455 460 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
465 470 475 480 

Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu 
500 505 510 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Ser Gin Phe Pro Phe 
530 535 540 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 
565 570 575 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
580 585 590 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr lie Glu Thr Phe 
610 615 620 

Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser lie Leu Pro Gin 
625 630 635 640 

Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp Leu Gly lie Phe 
645 650 655 



Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
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» 660 665 670 

Glu Thr Lys Ser Val Ser Thr Glu Lys lie Asn Asp His Arg Ala Val 
675 680 685 

Giy Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu lie Thr Ala Pro 
690 695 " 700 

Val Thr Met lie Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 720 

Ser Leu Ala lie Val Phe Ser Ser Tyr lie Thr Leu Val Val Leu Phe 
725 730 735 

Val Pro Lys Met Arg Arg Leu lie Thr Arg Gly Glu Trp Gin Ser Glu 
740 745 750 

Thr Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
755 760 765 

Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys lie 
770 775 780 

lie Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
785 790 795 800 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Asp 
805 810 815 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Ser Glu Pro Pro Asp Arg Leu 
820 825 830 

Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
835 840 



